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Abstract. We study statistics dependence of the probability distributions and the 
means of measured moments of conserved quantities, respectively. The required 
statistics of all interested moments and their products are estimated based on a simple 
simulation. We also explain why the measured moments are underestimated when 
the statistics are insufficient. With the statistics at RHIC/BES, the second and third 
order moments can be reliably obtained based on the method of Centrality bin width 
correction (CBWC), which can not be applied for the fourth order moments at low 
energy. With planning statistics at RHIC/BES H, and improved CBWC method, Ka^ 
in a finer centrality bin scale should be measurable. This will help us to understand 
the current observation of energy and centrality dependence of high-order moments. 


1. Introduction 

The main goal of the RHIC Beam Energy Scan (BES) program is to search for 
the Quantum Chromodynamics (QCD) critical point and phase boundary [1]. The 
calculations from lattice QCD and QCD based models have shown that the high-order 
moments of conserved quantities, such as net-baryon, net-charge and net-strangeness, 
are sensitive to the critical fluctuations El El 111 El El d. So the behavior of high-order 
moments of conserved quantities are greatly interested in heavy ion collisions [HiniEo]. 

However, the moments of conserved quantities are also affected by many non- 
critical backgrounds m, such as global baryon/charge conservation. The moments 
are substantially suppressed if global conservation is taken into consideration |T2]. In 
addition, artihcial cuts of phase space in experimental analysis are not trivial [TB] , such 
as the acceptance in rapidity and transverse momentum space, the selection of centrality 
of the collisions, and so on. 
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To reduce those non-critical background contributions, many new techniques are 
applied in moments analysis at RHIC/STAR [TTl ITS] . The Delta theorem method is 
used for estimation of statistical uncertainties [I6]. The track efficiency correction m 
is hrstly considered in event-by-event fluctuation. The initial size fluctuation is one of 
the important non-critical effects. Although it has not been taken into consideration 
in most of the theoretical calculations [181 m 120] . it exists in all event variables in 
experiments. As we know, the initial size is usually quantihed by the centrality of the 
collisions, which is determined by the multiplicity {Nch) of the hnal state. A centrality 
bin corresponds to a range of multiplicity. For a given centrality bin, i.e., a range of 
multiplicity, the initial size still fluctuates from event to event |2T]. In order to reduce 
this initial size fluctuation, it is proposed to calculate the moment in each of Nch- The 
measured moment is averaged over all Nch in a given centrality, where the average is 
weighted by the number of events in each of Nch- This is so called the method of the 
Centrality Bin Width Correction (CBWC) [22] . 

The CBWC method can only be reliably used if the statistics in each of Nch is 
large enough [23] . It is shown that measured moments increase with statistics and 
hnally saturate when statistics is sufficient. If the statistics is insufficient, the measured 
moments will be underestimated. In fact, it is already found that the fourth order 
moments would be underestimated at RHIC low energies. In order to increase the 
statistics, it is suggested to calculate the moment at Sl% centrality bin width, instead 
of each of Nch, in the CBWC method, i.e., so called the improved CBWC method [23] . 
It can reduce the initial size fluctuations as well. 

The published STAR results show of net-proton multiplicity distributions in 
central collisions (0-5% and 5-10% centrality) is systematically lower than that in the 
other centralities The behavior of na'^ in a hner centrality bin (such as 51% 

centrality) should be further investigated. At planning RHIC/BES II, the statistics will 
be increased to a few hundreds millions, see Table Era- With these large statistics, 
and improved CBWC method, it is possible to study the centrality dependence of high- 
order moments in a hner centrality bin scale, i.e., in each 51% centrality bin width for 
central collisions (0-10% centrality). If this can be done, the centrality dependence of 
high-order moments can be understood better. 


Energy (GeV) 

7.7 

9.1 

11.5 

13.0 

14.6 

19.6 

No. Events (Million) 

100 

160 

230 

250 

300 

400 


Table 1. The proposed statistics at RHIC/BES II. 


In this paper, we hrstly study the statistics dependence of probability distribution 
of measured moments in section II. It is shown that when the probability distribution of 
measured moments can be described by a normal one with ht goodness < 10, the 

corresponding statistics is enough for the moment analysis. The required statistics in 
each of Nch or 51% centrality bin width for all interested moments and their product is 
estimated. The statistics dependence of the means of measured moments are studied in 
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section III. The required statistics is obtained when the mean of the measured moment 
is saturated to its expectation. By this way, the required statistics is lower than those 
obtained from probability distribution of measured moments. It also explains why the 
measured mean of moment is systematically smaller than its expectation when the 
statistics is short. 

According to the statistics dependence of measured moments given in section II and 
III, the reliability of measured moments in current RHIC/BES I, and planed RHIC/BES 
II are discussed in section IV. Finally, the summary is presented in Section V. 

2. Statistics dependence of probability distributions of measured moments 

It is well known that the probability distribution of the measurement should be a normal 
one in the case of the sufficient statistics. For the statistics at current heavy ion collisions, 
usual observables, such as, K/ti fluctuation [25], and the mean of various multiplicity, 
whose probability distributions are already normally distributed. 

However, for high-order moments, in particular, the fourth, and even the sixth order 
moments, the required statistics should be largely increased. How much they should be 
is not yet clear. So in this section we firstly demonstrate how they change with statistics 
from a simple simulation. 

Let’s suppose that the particle and anti-particle are produced independently by 
a Poisson distribution. Then the net-particle follows a Skellam distribution. The 
parameters of Skellam distributions are the means of particle and anti-particle |26l 
177112S] . In our simulation, they are set to mi = 14.5, m 2 = 0.6, reference to the means 
of proton and anti-proton at low energies of RHIC/STAR [14] . 

In Figure the distribution of mean (V), variance a, and skewness S of 100 
events of Skellan distribution are shown. The distributions are obtained from 5 million 
(5M) independent samples and presented by black points in Figure [^a), (b) and (c), 
respectively. The yellow dashed lines are fitted by a normal distribution. 

We can see from Figure [^a) that the distribution of measured mean {N) is already 
a normal one. Its fit goodness is 1.48 as listed in Table So the statistics 100 

seems enough for first order moment analysis. While, the distribution of the second 
order moment a in Figure [^b) is slightly deviated from normal distribution, and its 
X^/ndf = 46.3. The distribution of third order moment S in Figure [^c) can not be 
fitted by a normal one any more with y^/nd/ = 228.9. So the higher the order of 
moments, the larger statistics are required. 

Increasing the statistics to magnitude 1,000, and 10,000, we re-calculate the second 
and third order moments. Their probability distributions are presented in the upper 
and lower panel of Figure]^ respectively. The left and right columns correspond to the 
statistics 1,000, and 10000, respectively. From Figure [^a) with statistics 1,000, we can 
see that the distribution of a can be fitted by a normal one with x^/'^df = 4.96. And 
Figure l^b) with statistics 10,000 gives an better normal distribution with x^/^df = 
1.55. 
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Figure 1. (Color online) Probability distributions of measured mean (N), variance 
CT, and skewness S of 100 events produced by Skellam distribution (black points). The 
yellow dashed lines are corresponding normal fittings. 


Statistics 

100 

1000 

10000 

100000 

IM 

X^/ndf {Mean) 

1.48 

1.08 

1.05 

0.99 

1.00 

X^/ndf {a) 

46.3 

4.96 

1.55 

0.97 

0.95 

xVndf (S) 

228.9 

56.9 

7.97 

2.08 

0.96 

X^/ndf (n) 

4962 

1955 

321 

37.6 

4.00 


Table 2. of normally fit for probability distributions of (IV), cr, S and k with 

statistics 100, 1000, 10000, 100000 and IM, respectively. 


Therefore, it seems that we can relax the usual requirement of ht goodness from 
X^/ndf ~ 1 to ~ 10 iii our simulation. In this case, if the distribution of 

measured moments is htted by a normal one with £t goodness y^/nd/ < 10, the 
corresponding statistics is sufficient for a reliable analysis. By this standard, the 
statistics 1000 seems enough for the second order moment analysis. It is a magnitude 
larger than that of the hrst order moment. 

Figure |^c) shows the probability distribution of the third order moments with 
statistics 1000. It can still not be well described by a normal one. The fit goodness is 
X^/ndf = 56.9, which is still larger than 10. When statistics is increased to 10,000, i.e.. 
Figure l^d), it can be htted by a normal one with y^/nd/ = 7.97. So the statistics for 
the third moments are 10,000. It is again a magnitude larger than that of the second 
moment. 

Obviously, the statistics for fourth order moment, k, has to be larger than 10,000. 
From Table we can see that when statistics increases to 1 million (IM), its ht is 
goodness /ndf = 4.00. So the statistics for the fourth order moment is around IM. 

The product of the fourth and second order moment, kct^, is another interested 
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Figure 2. (Color online) Probability distributions of the measured a (upper panel), 
and S (lower panel) with statistics: 1,000 (left column), and 10,000 (right column), 
respectively. The yellow dashed lines are the corresponding normal fittings. 


observable [22] • Its statistics should be mainly determined by the fourth order moment 
K. The probability distributions of measured product with statistics O.OIM, O.IM and 
IM are presented in Figure [^a), (b) and (c), respectively. The yellow dashed lines 
are fitted normal distributions. It shows clearly that figure 3(a) can not be fitted by a 
normal distribution, and Figure |^b) is still slightly deviated from it. Only Figure [^c) 
with statistics IM can be htted by a normal one. Consequently, the statistics of are 
IM, the same magnitude as those of k. 

3. Statistics dependence of the means of measnred moments 

It is known when the statistics is large enough, the mean of measured moments should 
be saturated and independent of the statistics, i.e., 

= (X)n, = (A')„, , (1) 

where X is the measured moments of (Af), a, S and k. The subscripts rii, n 2 and 
denote different statistics. 
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Figure 3. (Color online) Probability distributions of kct^. 


In previous study 
to its expectation is: 


, we defined the difference of the mean of measured moments 


d = 


- (■’f) 


th 


X 100%, 


( 2 ) 


where is theoretical expectation and 


IWtJ 

is experimentally measured mean 
of moments with hnite statistics. Consider the statistics at RHIC/BES I and its 
uncertainty of the measurements, the result is acceptable if d < 5% [23]. The statistics 
at RHIC/BES II has a few hundred millions, which is 1-2 magnitudes larger than those 
at RHIC/BES I. So the statistical uncertainty will be largely reduced. In this case, 
d < 1% is more reasonable within smaller uncertainties at planning RHIC/BES H. 

The left panel of Table lists statistics dependence of {a) and corresponding 
difference d, respectively. It shows that the difference d is already less than 1% when 
statistics is 100. This statistics is less than 1,000, which is obtained in the above section 
by requiring a normal distribution with the £t goodness x^/^df < 10. So the normal 
distribution of the measured moment is a more restrictive requirement for the statistics. 


Statistics 

100 

500 

1000 

Statistics 

1000 

2000 

5000 

10000 


3.85629 

3.87996 

3.88286 


0.893 

0.946 

0.979 

0.990 

d(%) 

0.76 

0.15 

0.077 

d(%) 

10.7 

5.45 

2.15 

0.97 


Table 3. Statistics dependence of (cr) and and the difference to their saturated 

value {d). 


In Figure l^a) and|^b), the statistics dependence of (kct^) for two sets of Skellam 
parameters: (a) mi = 14.5 and m 2 = 0.6, and (b) mi = 79.0 and m 2 = 65.0 (reference to 
the means of charge and anti-charge within RHIC/STAR acceptance [121) are presented. 
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respectively. With increase of statistics, in both Figure |^a) and Figure |^b) 

increases hrstly and then converges to their expectation unity. While, it converges more 
quickly in Figure |^a) than that in Figure |^b). So the required statistics is related to 
parent distribution. 




Figure 4. (Color online) Statistics dependence of (kct^), which are all averaged 
from 5M independent samples randomly generated from the simulations of Skellam 
distributions with the input parameters: (a) mi = 14.5 and m 2 = 0.6 and (b) mi = 79 
and m 2 = 65, respectively. 

The statistics dependence of a few typical in Figure |^a) is given in the right 

panel of Table It shows that the difference d is less than 1%, when statistics is 
above 10,000. It is about two magnitudes smaller than that obtained from the above 
section. So the normal distribution of the measured moment is also more restrictive for 
high-order moments. 

Figure |^a) and ib) demonstrate that the mean of measured moments are 
underestimated, when statistics is insufficient. The poorer the statistics, the smaller 
the mean. To search for the reasons, the probability distributions of measured at 
statistics 100, 500 and 10000 are presented in Figure [^a), |^b), andj^c), respectively. 
The black dashed lines shows the peak of the distributions. The blue lines are the 
mean of their probability distribution, i.e., The red solid lines are theoretical 

expectation. 

Figure [^a) shows that the peak(black dashed line) and mean(blue line) of the 
distribution are on the left side of the expectation (unity). The probability on the 
left side of unity is systematically higher than that on the right side. So the measured 
moment is underestimated when the statistics is poor. This is due to the lost information 
of the tail of net-particle multiplicity distributions, which can be found from Figure 
Figure a), (b) and (c) show the net-particle multiplicity distributions with statistics 
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Figure 5. (Color online) Probability distributions of kct^ with statistics 100, 500 
and 10000 shown from panel (a) to (c), respectively. 


1,000, 10,000, and O.IM. These three samples are randomly selected from the snb- 
samples using for calculating moments in this paper. It shows that the tail of the 
net-particle multiplicity distributions is dependent on the statistics. With insufficient 
statistics, the tail of net-particle multiplicity distributions shown in Figure |^a) and 
(b) can not be well detected. The smaller the statistics, the poorer information about 
the tail of the net-particle multiplicity distributions we can detect. With the statistics 
increased, this information can be better shown up, such as shown in Figure |^c). That 
is why decreases as the statistics decreases in case of insufficient statistics shown 

in Figure 

The influence of tail of net-particle multiplicity distributions on measured moments 
is similar to what had been observed at measurement of sixth order moment, where the 
tail of the net-proton multiplicity distributions is artihcially truncated [30] • It also shows 
that the more the lost information at the tail of the distribution, the smaller Cq/C 2 is 
measured. 

If the statistics is sufficient, the peak, mean and expectation, i.e., the three lines in 
Figure should be identical. So we can see from Figure [^a) to[^b), and hnally to[^c) 
that they become closer and closer. 

In addition, the statistical uncertainty of Figure i.e., the widths of probability 
distributions, look much larger than the deviation of the average (blue dashed lines in 
Figure]^ from the expectation (red solid lines). However, it is the statistical uncertainty 
of one sub-sample, similar to the experimental measurement at a hxed multiplicity. An 
experimental point at given centrality is averaged over all multiplicities in the centrality, 
analogous to the case of Figure 4. The corresponding statistical uncertainty is usually 
smaller than those given in Figure 5. Moreover, it has been shown in Figure 1 in Ref. [23] 
that the fluctuation of many sub-samples are out of 3a of expectation. So the deviation 
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Number of Net-proton 


Figure 6. Net-particle multiplicity distributions with statistics 1,000, 10,000 and 
O.IM shown from panel (a) to (c), respectively. 


of average from its expectation due to insufficient statistics can not be ignored. 

4. Measurement of at RHIC/BES I and II 

Up to now, we only consider the required statistics for a given distribution. In fact, 
this is corresponding to statistics in each of Nch with CBWC method, or in each 61% 
centrality bin with improved CBWC method. The statistics of real sample in minimum 
bias has to be quite larger than it. For example, the required statistics for (kct^) of 
net-charge is about O.IM (cf. Figure]^ and Table [^, while the corresponding statistics 
in whole minimum bias sample is around 200M assuming there is 100 multiplicity bin in 
0-5% centrality [31]. That is why we propose the improved CBWC method on moment 
analysis. 

As we have mentioned in motivation, another advantage of the improved CBWC 
method with each 51% centrality bin is that we can further investigate why in central 
collisions is systematically lower than the other centralities. With tens of millions of 
statistics at RHIC/BES I, the number of events in each 51% is around O.IM, similar 
statistics as Figure [^b). If we study behavior of in each 51% centrality bin, 
the statistical uncertainties shown in Figure [^b) is around 0.29. Consequently, the 
statistical uncertainties has already larger than the measured difference of centrality 
dependence of shown in Ref. d- 

At RHIC/BES H, the statistics in minimum bias will be increased to 100-400 
millions, cf. Table [Tj The statistics in suggested 51% centrality bin will be about 
1-4 millions. The width in Figure [^c) is about 0.1, which would be slightly larger 
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than statistical uncertainties at RHIC/BES II. So in this case, the statistics should be 
sufficient for analysis in each 61% centrality bin. This will be helpful to clarify why 
measured of net-proton multiplicity distributions in central collisions (0-5% and 
5-10%) is much lower than those in the other centralities. 

5. Summary 

In summary, the moments of conserved quantities are important measurements to 
study QCD phase structure and bulk properties. If the statistics is insufficient, the 
shape of net-proton/net-charge multiplicity distributions, especially for the tail of the 
distributions, can not be well detected. Consequently, the measured moments are 
underestimated and the obtained moments probability distributions would be deviated 
to normal distribution. 

In this paper, statistics dependence of the probability distributions and the means of 
measured moments and their product Ka‘^ are systematically presented. The behavior 
of simulation in this paper can be referenced to moments of net-proton multiplicity 
distributions at RHIC low energies. It is found that when the probability distribution of 
the measured moment can be htted by a normal one with £t goodness < 10, the 

corresponding statistics is enough for the moment analysis. The required statistics can 
also be obtained when the mean of the measured moment is saturated to its expectation 
within statistical error. By this way, the required statistics is 1-2 magnitude less than 
those obtained from probability distribution of measured moments. So the normal 
distribution of the measured moment is a more restrictive requirement for the statistics. 

Currently, the CBWC method in each of Nch is employed to moments analysis. 
With the restrictive requirement of the normally probability distribution, it is found 
that the statistics at RHIC/BES I is enough to extract the second and third order 
moments. However, at RHIC low energy, the statistics is not enough for either 
from the restrictive normally distribution or quite looser requirement obtained from 
the saturation of the mean of measured moments. The measured moment will be 
underestimated if statistics are not sufficient. With the improved CBWC method in 
each 61% centrality bin, we can obtain much reliable at RHIC low energies. 

The current measured Ka^ of net-proton multiplicity distributions shows that Ka‘^ 
in central collisions is systematically lower than that in the other centralities. With 
planning statistics for RHIC/BES H, and improved CBWC method, it is able to measure 
the high moments of conserved quantities in finer centrality bin scale. This will help 
us to understand the observation of the centrality and energy dependence of high-order 
moments. 
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